Zipf's law in phonograms and Weibull distribution in ideograms: comparison of English with Japanese.

نویسندگان

  • Terutaka Nabeshima
  • Yukio-Pegio Gunji
چکیده

Frequency distribution of word usage in a word sequence generated by capping is estimated in terms of the number of "hits" in retrieval of web-pages, to evaluate structure of semantics proper not to a particular text but to a language. Especially we compare distribution of English sequences with Japanese ones and obtain that, for English and Japanese phonogram, frequency of word usage against rank follows power-law function with exponent 1 and, for Japanese ideogram, it follows stretched exponential (Weibull distribution) function. We also discuss that such a difference can result from difference of phonogram based- (English) and ideogram-based language (Japanese).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ideographic Alexia without Involvement of the Fusiform Gyrus in a Korean Stroke Patient: A Serial Functional Magnetic Resonance Imaging Study

The Korean orthographic system consists of both phonograms (Hangul) and ideograms (Hanja). Hangul is a phonetic alphabet comprised of consonants and vowels that are grouped together to form syllables that generally exhibit regular correspondences between graphemes and phonemes. On the other hand, Hanja is derived from complex Chinese characters with distinct meanings. In this respect, Hanja and...

متن کامل

Effects of Related Term Extraction in Transliteration into Chinese

To transliterate foreign technical terms and proper nouns, in Japanese and Korean, phonograms, such as Katakana and Hangul, are used. In Chinese, the pronunciation of a source word is spelled out with Kanji characters. However, because Kanji comprises ideograms, different Kanji are associated with the same pronunciation, but can potentially convey different meanings and impressions. In this pap...

متن کامل

Random texts exhibit Zipf's-law-like word frequency distribution

It is shown that the distribution of word frequencies for randomly generated texts is very similar to Zipf's law observed in natural languages such as the English. The facts that the frequency of occurrence of a word is almost an inverse power law function of its rank and the exponent of this inverse power law is very close to 1 are largely due to the transformation from the word's length to it...

متن کامل

Title: Differential roles of spatial frequency on reading processes for ideograms and phonograms: A high-density ERP study Authors and affiliations:

The neural substrate of the dissociation between reading Japanese ideograms (Kanji) and phonograms (Kana) is currently unclear. To test whether spatial frequency (SF) information is responsible for this phenomenon, we recorded high-density event-related potentials (ERPs) with unfiltered or spatially filtered word stimuli in Japanese-speaking subjects. Kanji (early-learned, late-learned), Kana (...

متن کامل

Comparison of three Estimation Procedures for Weibull Distribution based on Progressive Type II Right Censored Data

In this paper, based on the progressive type II right censored data, we consider estimates of MLE and AMLE of scale and shape parameters of weibull distribution. Also a new type of parameter estimation, named inverse estimation, is introdued for both shape and scale parameters of weibull distribution which is used from order statistics properties in it. We use simulations and study the biases a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bio Systems

دوره 73 2  شماره 

صفحات  -

تاریخ انتشار 2004